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We reconstruct the shape of the primordial power spectrum (PPS) using a smoothing spline. Our 
adapted smoothing spline technique provides a complementary method to existing efforts to search 
for smooth features in the PPS, such as a running spectral index. With this technique we find no 
significant indication with WMAP first-year data that the PPS deviates from Harrison-Zeldovich 
and no evidence for loss of power on large scales. We also examine the effect on the cosmological 
parameters of the additional PPS freedom. Smooth variations in the PPS are not significantly 
degenerate with other cosmological parameters, but the spline reconstruction greatly increases the 
errors on the optical depth and baryon fraction. 



PACS numbers: 



I. INTRODUCTION 



The standard ACDM cosmological model with a 
Harrison-Zeldovich primordial spectrum successfully pre- 
dicts a vifide range of current observations (e.g.P|), im- 
plying that just five cosmological parameters may de- 
scribe the universe's composition and evolution. Current 
data can begin to probe the universe's primordial inho- 
mogeneities. A power-law primordial power spectrum fits 
the cosmic microwave background (CMB) and large-scale 
structure observations very well There remains, how- 
ever, the tantalizing possibility that there could be struc- 
ture in the primordial power spectrum (PPS). Theories 
of the universe's birth, such as infiation or the ekpyrotic 
model, predict specific primordial power spectra |^; 
different models of inflation predict different deviations 
from a pure power law. Thus, the statistical signifi- 
cance of scale-dependent features in the PPS can support 
and/or rule out models of the very early universe and test 
the inflationary paradigm. 

While we cannot directly measure the PPS, the flrst- 
year WMAP observations (J] of the cosmic microwave 
background (CMB) provide detailed information about 
the matter power spectrum as far back in time as wc can 
see. The CMB power spectra depend on the PPS through 
a well-known transfer function. Large-scale structure 
data can also probe the PPS. At present, most of the 
large-scale structure statistical power is on scales smaller 
than those probed by CMB experiments, and such analy- 
sis is complicated by non- linear growth of structure, bias, 
and redshift space distortions. Here, we do not consider 
large-scale structure, and focus instead on the statisti- 
cal significance of features detected in the PPS on CMB 
scales, such as a running of the spectral index and a de- 
crease in large-scale power, e.g. |a,IEI3- We use a non- 
parametric approach that does not assume any particular 
PPS model, but merely a smooth PPS fmiction. 
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Typically, when fitting a model to data, one simply 
finds the parameters that minimize the error. When re- 
covering a function from discrete data (such as the CMB 
angular power spectra), however, there are potentially 
infinite degrees of freedom and finite data. There is no 
limit on the number of parameters describing the func- 
tion, so it is possible to choose a function that interpo- 
lates the data to make the error nearly zero. Since the 
data is noisy, clearly such an interpolating function will 
have features created by the noise that do not exist in 
the true underlying function. Yet if one uses too few 
parameters (or the wrong choice of parameters), the fit- 
ting function could miss features that are actually there. 
In nonparametric inference, the data themselves can de- 
termine just how many effective degrees of freedom to 
give a function to recover the signal without "fitting the 
noise" . For background on nonparametric inference in 
astrophysics, see 

Some of the previous work to probe the shape of the 
PPS relies on parametric models predicted by inflation 
01 , piecewise linear reconstructions ^| , or linearly in- 
terpolating between sliding bins 0| . To invert the trans- 
fer function convolution and directly re-create the PPS 
is quite challenging 0, ITl. [l3| , as computational limi- 
tations force cut-offs or sampling that cause wiggles in 
the reconstructed PPS. Wavelets provide a rigorous non- 
parametric method to search for sharp features as well 
as trends in the PPS 0, Q but how to choose the 
number of wavelets to use is a subjective choice. Prin- 
ciple component analysis of the CMB can detect impor- 
tant departures from a scale-invariant PPS 0,0], but 
the PPS model used can bias the recovered cosmolog- 
ical parameters. Clearly, multiple methods are needed 
to cross-check each other and contribute their respective 
strengths to our understanding. 

We explore a powerful nonparametric method to test 
the statistical significance of smooth deviations from a 
scale- invariant, flat PPS. The technique we use, comple- 
mentary to the above methods, is a smoothing spline. 
Smoothing splines are a well-developed method used fre- 
quently in nonparametric function reconstruction |l8j |. 
but this is their first application to the PPS. 

As a side benefit, our analysis allows us to investigate 
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degeneracies between the PPS shape and the cosmolog- 
ical parameter values recovered from CMB data. There 
is concern (e.g., jl3|) that structure in the PPS could 
change WMAP's cosmological parameter results, possi- 
bly even eliminating dark energy. By allowing the PPS 
function more degrees of freedom, we can test the robust- 
ness of the cosmological parameters to flexibility in the 
PPS. 

In the sections below, we explain the smoothing spline 
method and how we apply it to reconstruct the PPS. We 
then demonstrate the application of our method on mock 
data sets, and present reconstructed primordial power 
spectra from the WMAP first-year data. We compare the 
smoothing spline model-testing results with the Akaike 
and Bayesian information criteria T^, '2^. Lastly, we 
explore how increasing the flexibility in the PPS affects 
the recovered cosmological parameters and their errors. 

II. SMOOTHING SPLINE 

Here we review the smoothing spline technique (see, 
e.g., [3). 

Suppose one wants to recover a function f{x) based 
on measurements of /, denoted /, at a discrete set of 
points Xi. In the smoothing spline method, one adds a 
roughness penalty to the chi-square error, and minimizes 
their sum: 

s^f^ ^ j2 ^f^'^ -J^"^^))" + A r {nx)fdx. (1) 
i=i "'^1 

The called "knots", and Ci denotes the standard 

deviation at each point. 

The integral of the function's second derivative squared 
is called the "roughness penalty" and disfavors jagged 
and bumpy functions that "fit the noise" . If f{x) is a gen- 
eral function with infinite degrees of freedom, minimizing 
the error alone will recover features created by random 
noise in the data, as well as the smooth underlying func- 
tion's features. The roughness penalty effectively reduces 
the degrees of freedom to only allow smooth functions. 
The smoothing parameter, A, determines the amount of 
roughness allowed; larger A implies a smoother function. 
As A goes to infinity, the problem becomes one of lin- 
ear regression, and as A goes to zero, the problem be- 
comes one of interpolation ("connect the dots"). Clearly, 
a method is required for choosing A, since different A give 
different answers for /. Cross validation, discussed later, 
provides a rigorous statistical technique for choosing the 
optimum smoothing parameter. 

Notice that the first term in S depends only on the val- 
ues of f at the points of measurement. It can be shown 
(e.g., jis p that the function that minimizes the rough- 
ness penalty for fixed values of f{xi) is a cubic spline: an 
interpolation of points via a continuous piecewise cubic 
function, with continuous first and second derivatives. 
The continuity requirements uniquely determine the in- 
terpolating spline, except at the boundaries. Here, we use 



the assumption-independent boundary condition that the 
third derivative also be continuous at the first and last 
interior knots. 



III. APPLYING THE SMOOTHING SPLINE TO 
THE PRIMORDIAL POWER SPECTRUM 

The smoothing spline is a natural technique to recon- 
struct the PPS, since we expect it to be a smooth func- 
tion. As we do not have direct measurements of the PPS 
nor unlimited computing power, we cannot bring the full 
power of the smoothing spline method to bear on this 
problem. Nonetheless, it is an informative tool that can 
only grow more illuminating with better data and com- 
puting power. The method allows us to give the PPS 
much more freedom than parametric models, and mea- 
sure how the data take advantage of this extra parameter 
space. 

The data we use to recover the PPS are the first-year 
WMAP measurements of the CMB temperature (TT) 
and temperature-polarization cross-correlation (TE) an- 
gular power spectra as a function of multipole moment 
i II, |21| . Large-scale structure data could provide more 
information, and while the largest scales probed by large- 
scale structure now overlap with the small scales probed 
by CMB, we leave an analysis that includes galaxy sur- 
veys data for future work. Here, we concentrate on devel- 
oping the method to be applied in the cosmological con- 
text to reconstruct the primordial power spectrum, where 
there is a complicated non-linear function connecting 
the PPS to the observed angular power spectrum of the 
CMB. In addition, we are interested in features on scales 
probed by CMB data, about which there has been con- 
troversy in the literature |ili,|lli|li|ll|ll|ll[ll[l3. 
We write the primordial spectrum in terms of the scalar 
metric fiuctuation, x, as 

i2nfP{k, t)S^{k + k') = ixik, t)x{k', t)), (2) 

where () denote the ensemble average. The dimensionless 
PPS in an isotropic universe is then, 

nk) - ^Pik)k\ (3) 

We generalize the smoothing spline method, described 
in the previous section, to apply to the PPS. In place 
of the chi-square error, we use the log likelihood func- 
tion as computed by the code provided by the WMAP 
team The theory Ci depend on P(fc) and cosmolog- 
ical parameters via the transfer function, Ci = T{'P{k))^ 
computed using the CMBFAST code 23] modified to ac- 
cept a spline PPS. We place the roughness penalty on 
'P(fc) as a function of In(fc) because we wish to discern 
deviations from a power-law PPS. 

The penalized error we minimize (compare to Eq.(Q) 
is thus, 
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5(7-") = — In cosmological parameters) + XF{V), 

(4) 

with the roughness penalty given by, 

F{r) = / {P"{lnk)fdlnk. (5) 

J\n fc„i„ 

Since the transfer function is a convolution over 'P{k), 
we cannot choose the knots so each knot corresponds 
univocally to a data point. Due to computational limi- 
tations, we cannot choose nearly as many knots as data 
points. Thus, we are only sensitive to smooth, large- 
scale features and the general shape of the primordial 
power spectrum. However, if there are sharp features 
that are sufficiently statistically significant, this method 
may still detect a deviation from scale-invariance. We 
space the knots equally in ln(A;) between fcmin = 1 x 10^^ 
and fcmax = 0.2 [S^l- Note that there is a trade-off be- 
tween the number of knots and the A values: reducing the 
number of knots forces the function to be smoother, and 
thus has a similar effect to increasing A (and vice- versa). 

The smoothing spline method thus allows for broad 
variations in the PPS. It is general enough to include 
shapes produced by a running of the spectral index or a 
gentle cutoff at low multipoles. The smoothing param- 
eter determines the above-mentioned balance to recover 
signal without fitting noise. 



detailed description. MCMC thus maps the likelihood 
surface to as much accuracy as needed. 

The details of our implementation follow. Our MCMC 
code computes and records S and the roughness penalty, 
and maximizes —S rather than the log likelihood. De 
Boor's code calculates the cubic spline that interpo- 
lates the knot values. To choose the optimum step size 
for fastest convergence, we find the covariance matrix for 
a preliminary, short chain and take MCMC steps along 
the eigenvectors of the covariance matrix with eigenvalue 
step sizes. To check convergence and mixing, we use 
cusum path plots J28|| calibrated from and compared with 
a converged WMAP chain l23| . 

Usin g impo rtance sampling (re-weighting the Markov 
chain) |25l |2y| , we can vary A slightly without re- running 
a given chain. Since we store the value for F as well 
as S for each point in the chain, we can re-weight each 
point's contribution to the posterior density by a fac- 
tor of e(^°^^)^, where Ao is the smoothing parameter for 
the initial chain, so the resulting density corresponds to a 
smoothing parameter A. This procedure only works when 
g(Ao-A)F ^ g-s^ j£ ^Yxe shape of as a function of pa- 
rameter space changes significantly, the original Markov 
chain will not have adequately explored regions of lowest 
S for the new A, and importance sampling does not give 
meaningful results. 



B. Cross validation 



A. Markov Chain Monte Carlo 

For the smoothing spline PPS reconstruction, we as- 
sume a flat universe and treat as free parameters the 
physical matter density flmh^, the physical baryon den- 
sity flbh^, the sound horizon's angular size A, and the 
optical depth r (through the parameter Z = e~^'^), e.g., 
[2^. In addition to these four cosmological parame- 
ters, our parameter space includes the value of P(fc) at 
each knot. Thus, the parameter space is nine dimen- 
sions for a five-knot spline, twenty-one dimensions for a 
seventeen-knot spline. We seek the point in parameter 
space that minimizes Eq.Q. It is computationally im- 
possible to test all combinations of the cosmological pa- 
rameters and values of P{k) at the knots. Therefore, we 
use the Markov Chain Monte Carlo (MCMC) method to 
sample the parameter space, map the log likelihood (or, 
in this case, S) surface as a function of parameter space, 
find the extremum, and locate the one-sigma bounds. 

MCMC evaluates and records the value of S at discrete 
points in such a way that the density of the recorded 
points in parameter space is proportional to the poste- 
rior distribution. As it evaluates each point, it decides 
whether or not to add the coordinates of that point and 
its S value to a list. This list is called a "Markov chain" . 
We use the Metropolis-Hastings algorithm to select which 
points to put on the list and how many times each point 
is repeated. See |22, section 3, or [2^ |2^ for a more 



The smoothing spline requires an objective method to 
choose the optimum value of A to recover the underlying 
function. Cross-validation (CV) (see, e.g., 0) quantifies 
the notion that if we have found the correct underlying 
function, the result should be insensitive to more data. 
The most rigorous form of CV is referred to as "leave out 
one": run the analysis leaving out one data point, get 
the optimum function, and compute the error between 
that data point and the recovered function's value at that 
point. Then do this for each data point, sum the resulting 
errors, and choose the A that minimizes this sum. 

One-fold CV is too computationally expensive for us. 
We have data points at 899 ^'s, so we would have to run 
899 Markov chains if we left each £ out once. Because the 
transfer function lies between the PPS and the data, we 
cannot apply the calculation shortcut in Instead, 
we use ^-fold cross validation, where N = 899 is the 
number of data points (^'s). We run a Markov chain 
using only half the data (every-other £) with some initial 
A. We then take the coordinates of every point from that 
Markov chain {V knots and cosmological parameters), 
and compute the log likelihood of the corresponding Ci 
with respect to the other half of the data. In Cqy, for each 
point. Next we repeat this process with the two halves 
of the data switching roles. Using importance sampling, 
we choose the best-fit V for each half of the data for any 
nearby A, and note the corresponding In £cv with respect 
to the other half. Summing these two InCcv gives the 
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"CV score" for a given A. The best A is the one with the 
highest CV score. 

Since the roughness penalty is independent of cos- 
mological parameters, we keep them fixed for the CV 
Markov chains and vary only the PPS |4(](|. We check 
how much fixing the cosmological parameters affects the 
CV results by running CV for two different sets of fixed 
cosmological parameters, and once allowing the cosmo- 
logical parameters to vary. 

An important assumption of CV is that the data "left 
out" to perform CV is not correlated with the remain- 
ing data. If the data are correlated, the smoothing pa- 
rameter will be under-estimated by CV, and a general- 
ization of CV for correlated errors should be used [2^ . 
When performing ^-fold CV with full-sky CMB exper- 
iments such as WMAP, the omitted data points are not 
strongly correlated to the remaining data points. In fact, 
for the WMAP data, neighboring points are negligibly 
correlated (see figure (14) of j^])- 



IV. TESTING THE SMOOTHING SPLINE 
METHOD ON MOCK DATA 

To assess the performance of our method, we test it 
on mock data sets created from select primordial power 
S£ectra using the publicly available HEALPix software 
pl| . The power spectra we choose to examine are A) the 
best-fit running of the spectral index (as — —0.047) with 
the best-fit WMAP parameters, B) a running one-sigma 
steeper than the WMAP best-fit (a^ = -0.087), and C) a 
scale-invariant power spectrum with a sharp cutoff at fc = 
0.002 Mpc (431 . We choose these PPS because a running 
and a cutoff are two features that have been claimed to 
be consistent with the data SimSE'sl, and 
we would like to determine how well our method would 
detect them. 

The Ci errors are non-gaussian, especially at low 
and the WMAP team uses a sum of a lognormal and a 
normal distribution to approximate the non-gaussianity 
[22|. Thus for case A we reconstruct the spectrum twice: 
in one case using the WMAP likelihood code, in the other 
case using the ideal likelihood (Eq.(4) of ^2|) for £ < 100. 
In all other cases we use the WMAP likelihood code. Our 
reconstruction uses a five-knot smoothing spline and zero 
lambda. 

We plot the spectral index as a function of fc, ?T,(fc) 
using the definition [4^ . 



n(fc) = 1 + 



ln(7^(fc)) - ln(7^(fco)) 
In /c — In kn 



(6) 



with fco = 0.002Mpc^^ for cases A and B. Case A is 
shown figures (Q) and ||2Jl, where the former used the 
WMAP likelihood approximation, and the latter used 
the ideal likelihood calculation for ^ < 100 ignoring noise 
and coupling due to sky cut. Figure ||2Jl shows the one- 
sigma running (case B). In all the figures, the dashed line 
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FIG. 1: The spectral index n(fc) of the reconstructed PPS 
from the mock data with a running of as = —0.047 and the 
best-fit values for the WMAP running spectral index model 
for the rest of the cosmological parameters. The reconstruc- 
tion used a smoothing spline with five knots equally spaced 
in In k and no roughness penalty, and the likelihood approxi- 
mation used by the WMAP team. The dark line denotes the 
mean, the shaded region denotes one standard deviation, and 
the dashed line denotes the fiducial PPS. 



denotes the fiducial PPS, and the solid line the mean re- 
constructed spectrum. Our method successfully recovers 
the PPS over the WMAP-sensitive scales, and detects a 
noticeable deviation from n = 1. The comparison of the 
two case A reconstructions, figures ^ and ||2J), shows 
that the likelihood approximation does not significantly 
affect the reconstructed spectrum. The bend at low k 
in figure ijjjl is due to the prior requiring 7'(fc) to be 
greater than ten at the knots. Clearly ^(fc) must be pos- 
itive, but occasionally the interpolating spline between 
the knots goes negative, and we must discard such steps 
in the Markov chain. Choosing a lower prior of ten for 
the knot values helps keep the interpolating spline from 
going negative. The "sharp cutoff" PPS reconstruction, 
shown in figure Q, hits a non-negative prior on the PPS 
knot values more strongly, since splines with steep slopes 
are more likely to go negative. It resembles a running 
of the spectral index, showing how our method tends to 
"smooth out" sharp features. 



V. PPS RECONSTRUCTION RESULTS 

We next apply the smoothing spline reconstruction to 
the combination of the first-year WMAP TT and TE 
power spectra 0, |^ . Our aim here is to test the sta- 
tistical significance of possible smooth deviations from 
scale invariance; our "null hypothesis" is a scale-invariant 
(n = 1) PPS. Thus, as a benchmark, we run a Markov 
chain to find the best-fit parameters for such a model. 
Since the volume of parameter space for wiggly power 
spectra is far greater than the volume for fiat spectra, the 
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FIG. 2: The spectral index h{k) of the reconstructed PPS 
from the mock data with a running of as = —0.047 and the 
best-fit values for the WMAP running spectral index model 
for the rest of the cosmological parameters. The reconstruc- 
tion used a smoothing spline with five knots equally spaced 
in In k and no roughness penalty, and an exact likelihood cal- 
culation for £ = 2 ~ 100. The dark line denotes the mean, 
the shaded region denotes one standard deviation, and the 
dashed line denotes the fiducial PPS. 
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FIG. 3: The spectral index h{k) of the reconstructed PPS 
from mock data with a running of Ua = —0.087 and the best- 
fit values for the WMAP running spectral index model for 
the rest of the cosmological parameters. The reconstruction 
used a smoothing spline with five knots equally spaced in In k 
and no roughness penalty. The dark line denotes the mean, 
the shaded region denotes one standard deviation, and the 
dashed line denotes the fiducial PPS. The bend at low k is 
due to the spectrum hitting the prior. 



Markov chain tends towards more curvy (higher F) spec- 
tra than flat ones, due to entropy. One may compensate 
for this effect by increasing A, but it is still very unlikely 
for a Markov chain started from a curved PPS to line up 
the spline's knots to retrieve a totally flat n = 1 spec- 
trum. To minimize burn-in and to make sure our chains 
include the null hypothesis case, we start the smoothing 



FIG. 4: The reconstructed 'P{k) from mock data with a cutoff 
at fc = 0.002 the best-fit values for the WMAP fiat PPS model 
for the rest of the cosmological parameters. The reconstruc- 
tion used a smoothing spline with five knots equally spaced 
in In k and no roughness penalty. The dark line denotes the 
mean, the shaded region denotes one standard deviation, and 
the dashed line denotes the fiducial PPS. 



spline Markov chains from a scale-invariant PPS with 
cosmological parameters set to the n — 1 model's best-fit 
parameters. 

After running several preliminary chains with differ- 
ent numbers of knots and different smoothing parame- 
ters, we settled with three well-mixed, converged Markov 
chains with smoothing splines, one with five knots and 
roughness penalty A = 0.1, one with five knots and 
A — 0,and one with seventeen knots and roughness 
penalty A = 2 x 10^^. We use the roughness parame- 
ter of 2 X lO^'^ for the 17-knot spline because it is very 
tiny constraint on the flexibility of the PPS, but sufficient 
to allow the Markov chain to converge in a reasonable 
amount of time. 

Figure jSJ shows the mean spectral index (Eq.®) and 
one-sigma confldence interval for the WMAP data for a 
smoothing spline with flve knots and a smoothing pa- 
rameter of 0.1. The resulting PPS is indistinguishable 
from flat. Removing the roughness penalty yields the 
recovered PPS shown in figure which has a Ax^ of 
only 1.79 away from flat. Giving the PPS spline a lot 
more freedom, seventeen knots and a smoothing param- 
eter of only 2 x 10~^, one recovers the function in figure 
Q. Although this spline was given much more parame- 
ter space and a much lower A than preferred by the data, 
the one-sigma confidence interval is still consistent with 
a fiat n = 1 spectrum. 

In all these cases, the posterior probability density cre- 
ated by the Markov chains concentrates in regions of low 
roughness penalty. This implies that the data do not 
require the additional fiexibility of the smoothing spline 
reconstruction. For all the steps in all our runs with finite 
A, XF remains on the order of 0.1% to 1% of ln£. This 
means that moving far away from a fiat power spectrum 
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cannot improve the likelihood sufficiently to overcome the 
roughness penalty. 

CV confirms this qualitative interpretation. We ran 
our -y-fold CV for two different sets of fixed cosmological 
parameters, the best-fit flat and the five-knot no-lambda 
best fit, and find an optimum A ^ 0.1 for a five-knot 
spline. Therefore, the PPS should be closer to scale in- 
variant than the five-knot spline with A = results (figure 

and at least as close to scale invariant as the five-knot 
spline with A = 0.1 (figure [SJ. Since increasing the num- 
ber of knots or decreasing A both increase the effective 
degrees of freedom, we can also conclude that the PPS 
should be much closer to scale invariant than the 17-knot 
sphne with A = 2 x 10^^ result (figure [7)). 

Figure © shows a flatter PPS than with the same 
number of knots and same smoothing parameter (A 0). 
Thus, we conclude that if the PPS had a running of |as | > 
0.05 the smoothing spline technique applied to WMAP 
data would have recovered it. The smoothing spline flnds 
no evidence for deviations from a scale invariant power 
spectrum from WMAP flrst-year data. 
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FIG. 6: The spectral index n(fc) of the PPS reconstructed 
using a smoothing spline with five knots equally spaced in 
Infc and no smoothing penalty. The dark line denotes the 
mean and the shaded region denotes one standard deviation. 
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FIG. 5: The spectral index h{k) of the PPS reconstructed 
using a smoothing spline with five knots equally spaced in In k 
and a smoothing parameter of 0.1. The dark line denotes the 
mean and the shaded region denotes one standard deviation. 
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FIG. 7: The spectral index n(fc) of the PPS reconstructed 
using a smoothing spline with 17 knots equally spaced in In k 
and a smoothing parameter of 2 x 10~^. The dark line de- 
notes the mean and the shaded region denotes one standard 
deviation. 



A. Comparing CV with other criteria 

The preference for a very flat PPS, indicated by 
the spline reconstruction and its CV, can be cross- 
checked with alternative model-testing techniques, such 
as Bayesian evidence. 

We apply the Akaike information criterion (AIC) and 
Bayesian information criterion (BIC) to compare the five- 
knot, A = spline with the scale invariant model. These 
methods are improvements on the standard likelihood 
ratio test in that they rely on fewer assumptions 
[33 |. Both criteria add a penalty for additional degrees 
of freedom to the negative log likelihood, so that a lower 



value for this sum indicates the preferred model. The 
AIC ^3 comes from an approximation to the KuUback- 
Leiber information entropy, used to measure the differ- 
ence in goodness of fit between two models. The BIC 
[20l | approximates the Bayes factor, or the ratio of the 
likelihood of two models with a flat initial prior. The 
BIC has a more stringent requirement to accept extra 
parameters than the AIC [s^l- Here we find that both 
criteria prefer the simpler, scale-invariant model; see ta- 
ble (HJ. A difference in BIC of ten or more is considered 
very strong evidence for the preferred model, and here we 
have a difference of twenty-six favoring the flat spectrum 
over the five-knot spline. Even the weaker AIC prefers 
the simpler model by a difference of six. 

Our results are consistent with previous work applying 
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5-knot spline 


minimal (n—l) 


AIC 


1448 


1442 


BIG 


1494 


1468 



TABLE I: The Akaike information criterion and Bayesian in- 
formation criterion applied to the five-knot smoothing spline 
PPS with zero lambda and the minimal cosmological model. 
Since preferred models have lower AIC and BIG, both criteria 
prefer the minimal model. 



Bayesian evidence to determine the significance of the 
spectral index n in the cosmological parameter set (versus 
setting n = 1). Liddle 'sij applies the AIC and BIG 
and finds that the data favor the simpler scale-invariant 
model. Trotta [s^ uses the Laplace and Savage-Dickey 
approximations to the Bayes factor, and finds 2:1 odds 
in favor of the Harrison-Zeldovich spectrum using CMB 
data. Our tests extend these results and compare the 
Harrison-Zeldovich model to shapes with more flexibility 
than the one free parameter n. 



VI. SENSITIVITY OF COSMOLOGICAL 
PARAMETERS ON THE PPS 

Because the measured CMB power spectrum depends 
nonlinearly on both the PPS and the cosmological pa- 
rameters, there are degeneracies between the PPS shape 
and the cosmological parameters. We now investigate 
how the increased freedom in the PPS allowed by the 
spline technique affects the recovered values of the cos- 
mological parameters. Similar analyses yielding quali- 
tatively similar results using multiple CMB data sets, 
large-scale structure, and piecewise and band-power PPS 
reconstructions, can be found in 0,0 • 

Figure ^ (dark lines) shows the cosmological param- 
eter likelihood contours for the one sigma marginalized, 
one sigma joint, and two sigma joint confidence intervals 
for the five-knot spline with no lambda, figure Q shows 
the same for the seventeen- knot spline with A — 2x10"^, 
and figure (|10|) shows the same for the five-knot spline 
with A = 0.1 . For comparison, the thin gray lines in 
each figure show the likelihood contours when the PPS 
is forced to be Harrison-Zeldovich. 

We find that the best-fit values for all parameters ex- 
cept Z do not significantly change as the PPS is allowed 
more freedom compared to the scale-invariant case. The 
error bars increase, however, especially in the 17-knots 
case. 

Table shows the values and errors for the cosmolog- 
ical parameters for our two smoothingspline reconstruc- 
tions compared to the WMAP errors [l| and the minimal 
model (n = 1) errors. The parameter that shows least 
sensitivity to the increased fiexibility in the PPS is flmh?, 
followed by A. These parameters are the least degenerate 
with smooth features in the PPS. The increased freedom 



in the PPS increases the errors significantly (by a factor 
~ 5 for the five-knot case compared to the Harrison- 
Zeldovich case) for ilbh^. Drastic changes are also ob- 
served in the likelihood contours for Z. Following the 
WMAP team [l(|, we have imposed a hard prior on r of 
T < 0.3. While the effect of this prior is not very impor- 
tant for the scale-invariant case, it sharply cuts the likeli- 
hood contours in the spline cases. As additional freedom 
in the PPS is allowed, the recovered best-fit value for r 
is pushed towards the limit imposed by the prior. With 
only the TT and TE power spectra, the optical depth 
is significantly degenerate with the PPS shape. We ex- 
pect the addition of EE polarization power spectrum will 
break this degeneracy. 



VII. CONCLUSIONS 

Cosmological data now offer the ability to go beyond 
finding preferred parameters under a given model, and 
to compare the success and statistical preference for dif- 
ferent models. Here we have used a nonparametric (or 
minimally parametric) method to test a subset of smooth 
PPS models. We have presented smoothing spline recon- 
structions of the PPS using CMB data, complementary 
to other nonparametric reconstructions presented in the 
literature. We applied this technique to WMAP first-year 
data. 

The smoothing spline reconstruction gives the PPS sig- 
nificantly more freedom than a power-law or a running 
spectral index model, and is poised to recover smooth fea- 
tures in the PPS. We find that the spline is not sensitive 
to small, sharp features in the PPS, but if a feature such 
as a running of the spectral index as ^ —0.05 is present 
in WMAP first-year data, the technique can recover it. 

We first reconstructed the PPS without any rough- 
ness penalty with a five-knot spline, thus leaving the 
PPS four extra degrees of freedom compared to a scale- 
invariant n = 1 model. Then, to quantitatively assess if 
the data require extra effective degrees of freedom in the 
PPS, we used three different statistical tests. We first ap- 
plied cross validation, the standard statistical method for 
choosing a smoothing spline's smoothing parameter. We 
found that CV prefers a high smoothing parameter, in- 
dicating a preference for a fiat (scale-invariant) PPS. We 
then compared this result with the AIC and BIC, infor- 
mation criteria for measuring the statistical significance 
of additional parameters in a model. Both the AIC and 
BIC strongly indicate no preference for the extra freedom 
of the smoothing spline. 

Thus, we found no statistically significant indications 
for any deviations from a scale invariant (n = 1) PPS. 
We find that most cosmological parameters, especially 
the matter density f2mj remain fairly robust to flexibil- 
ity in the PPS. However, the constraints on n^h^ and 
to a greater extent on the optical depth r are signifi- 
cantly weakened. We expect that the addition of EE- 
polarization data will break the degeneracy between opti- 
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5-knot spline 17-knot spline power law running minimal {n=l) 

0.021±0.002 0.022±0.004 0.024±0.001 0.023±0.002 0.0240±0.0004 

n^/i^ 0.14±0.02 0.14±0.03 0.14±0.02 0.14±0.02 0.15±0.02 

h 0.67±0.06 0.63l°lo 0.72±0.05 0.70±0.05 0.70±0.05 

r O-'iltm 0.26lo.o8 0.166t;5o7? 0.20±0.07 0.18±0.05 

TABLE II: Cosmological parameters and errors recovered by smoothing spline PPS's versus the WMAP results \^ for a power 
law and running spectral index PPS and an n=l PPS. The values and errors for Sim remain quite independent of PPS flexibility. 
Allowing smooth features in the PPS loosens the constraints on fit, h, and r. * denotes hitting the prior. 



cal depth and the PPS shape. Large-scale structure data, 
which now overlaps the scales of the CMB data, could 
help even more in constraining the cosmological param- 
eters when the PPS is flexible. Here we have shown that 
even using CMB data alone, the cosmological parameters 
are quite robust. Our results are consistent with and 
0. In particular, we find roughly the same increases in 
errors and changes in the values of the cosmological pa- 
rameters as Mukherjee and Wang's ,^ PPS reconstruc- 
tions using 11 wavelet band-powers and top-hat bins and 
CMB data only. The addition of large-scale structure 
data naturally shrinks the errors slightly. 

The smoothing spline is a statistically rigorous tool, 
and has a solid Bayesian justification when applied to di- 
rect measurements of the sought-after function fTsL l36l| . 
We had to modify the smoothing spline method due 
to the complicated transfer function between the CMB 
data and the PPS. With more computing power or faster 
methods to compute the theory (such as CMBwarp 
|37j). we could, in theory, include as many knots as data 
points, and perform one-fold CV to choose the optimum 



smoothing parameter. Such a reconstruction would be 
able to soundly detect features as fine as the data could 
possibly distinguish. 
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penalty. Dark contours denote one sigma marginalized, one sigma joint, and two sigma joint. Over-plotted with thin, gray 
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FIG. 9: Errors on cosmological parameters from a Markov chain run for a smoothing sphne V with 17 knots and and a smoothing 

parameter of 2x10"''. Dark contours denote one sigma marginahzed, one sigrna joint, and two sigma joint. Over-plotted with 
thin, gray contours are the errors for a n = 1 flat PPS for comparison. is the baryon fraction, f2c is the cold dark matter 



density, A is the present angular size of the sound horizon at the last scattering surface, and Z 
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FIG. 10: Errors on cosmological parameters from a Markov chain run for a smoothing spline V with 5 knots and and a 
smoothing parameter of 0.1. Dark contours denote one sigma marginalized, one sigma joint, and two sigma joint. Over-plotted 
with thin, gray contours are the errors for a n = 1 flat PPS for comparison, ilb is the baryon fraction, Oc is the cold dark 
matter density, A is the present angular size of the sound horizon at the last scattering surface, and Z = e~^'^ where r is the 
optical depth. 



